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Abstract 

Discrepancy measures how uniformly distributed a point set is with respect to 
a given set of ranges. There are two notions of discrepancy, namely continuous 
discrepancy and combinatorial discrepancy. Depending on the ranges, several possible 
variants arise, for example star discrepancy, box discrepancy, and discrepancy of half- 
spaces. In this paper, we investigate the hardness of these problems with respect to 
the dimension d of the underlying space. 

All these problems are solvable in time n'~'^'^\ but such a time dependency quickly 
becomes intractable for high-dimensional data. Thus it is interesting to ask whether 
the dependency on d can be moderated. 

We answer this question negatively by proving that the canonical decision prob- 
lems are W[l]-hard with respect to the dimension. This is done via a parameterized 
reduction from the Clique problem. As the parameter stays linear in the input 
parameter, the results moreover imply that these problems require nP'^'^^ time, unless 
3-Sat can be solved in 2°^'^^ time. Further, we derive that testing whether a given set 
is an e-net with respect to half-spaces takes n^^'^'^ time under the same assumption. 
As intermediate results, we discover the W[l]-hardness of other well known problems, 

*Institut fiir Informatik, Preie Universitat Berlin, Takustrasse 9, D-14195 Berlin, Germany, 
pEinosQinf . f u-berlin . de 

^Institut fiir Informatik, Universitat Bayreuth, Universitatsstrasse 30, 95447 Bayreuth, Germany, 
christian . knauerOuni-bayreuth . de 

tThis research was supported by the German Science Foundation (DFG) under grant Kn 591/3-1. 

§Max-Planck-Institut fiir Informatik, Stuhlsatzenhausweg 85, 66123 Saarbriicken, Germany, 
wahlOmpi-inf .mpg.de 

^This research was supported by the German Research Foundation (DFG) via its priority program 
"SPP 1307: Algorithm Engineering", grant DO 749/4-1. 

"institut fiir Informatik, Freie Universitat Berlin, Takustrasse 9, D-14195 Berlin, Germany, 
daniel .wernerSfu-ber lin.de 

**This research was funded by Deutsche Forschungsgemeinschaft within the Research Training Group 
(Graduiertenkolleg) "Methods for Discrete Structures". 



1 



such as determining the largest empty star inside the unit cube. For this, we show 
that it is even hard to approximate within a factor of 2" . 

Keywords: discrepancy, epsilon-nets, geometric dimension, parameterized com- 
plexity, inapproximability. 

1 Introduction 

Geometric discrepancy has significant applications in several areas, including optimiza- 
tion, statistics, combinatorics, and computer graphics. See for example the textbooks 
by |MatlOj . |ChaOO] . and |DT97] . In particular, the star discrepancy of a point set is 
important in multi-variate numerical integration, where the error of a quasi-Monte Carlo 
integration is bounded as a function of the star discrepancy of the point set used in the 
integration (by the Koksma-Hlawka inequality, see |Nie92j ). In addition, the difficulty 
of computing the star discrepancy can be an obstacle to evaluating different methods for 
creating low-discrepancy point sets, see, for example, |DGW10] . 

Unfortunately, computing the star discrepancy of a point set using any known method is 
computationally intensive; given a set P of n points in d dimensions, every known method 
for getting even a constant-factor approximation of its star discrepancy has a running time 
of n^^^^ (see below). As many applications, for example in financial mathematics, require 
integration of functions in tens or even hundreds of dimensions, this quickly becomes 
infeasible. 

The main question we ask here is whether this dependency on d is necessary. Specifi- 
cally, we ask whether the decision version for star discrepancy (and other related problems) 
can be solved in 0{f{d)n'^) time, for some computable function / and some constant c in- 
dependent of d, i.e, whether it is fixed-parameter tractable with respect to d. Note that 
A^P-hardness for a problem does not exclude such a possibility. Proving that a problem is 
W[l]-hard with respect to d implies that such an algorithm is not possible, under standard 
complexity theoretic assumptions. 

1.1 Parameterized Complexity. 

We review some basic definitions of parameterized complexity theory; see, for example, 
one of the textbooks by |DF99j . |FG06j . and |Nie06j for an introduction. A problem with 
input size n and a positive integer parameter k is fixed-parameter tractable (fpt for short) 
if it can be solved by an algorithm that runs in 0{f{k) ■ n'^) time, where / is a computable 
function depending only on k, and c is a constant independent of k; such an algorithm is 
(informally) said to run in fpt-time. The class of all fixed-parameter tractable problems is 
denoted by FPT. 

An infinite hierarchy of classes, the W-hierarchy, has been introduced for establishing 
fixed-parameter intractability. Its first level, W[l], can be thought of as the parameter- 
ized analog of NP: a parameterized problem that is hard for W[l] is not in FPT unless 
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FPT=W[1], which is considered highly unhkely under standard complexity theoretic as- 
sumptions. Hardness is sought via an fpt-reduction, i.e., an fpt-time many-one reduction 
from a problem 11, parameterized with k, to a problem 11', parameterized with k', such 
that k' < g{k) for some computable function g. 



1.2 Discrepancy and epsilon-nets 

In this section, we define the basic notion of discrepancy. Let X be set and 7^ be a set of 
subsets of X, both not necessarily finite. A tuple (X, 71) is called a range space. 

If the range space arises from point sets and geometric objects, such as half-spaces or 
hyperrectangles (boxes), we call it a geometric range space. Often, as in our case, the 
ranges are given implicitly. As an example, for a point set P C M'^, we define 

Hp := {H n P I if is a half-space} . 

This is the range space induced by all half-spaces in M'^. Observe that if P is finite, even 
though there are infinitely many half-spaces, the size of TZ is at most 2l^L 

We will now define two different notions of discrepancy, namely the continuous discrep- 
ancy and the combinatorial discrepancy. 



1.2.1 Continuous Discrepancy 



This concept relates the volume (i.e., its Lebesgue-measure) of a point set to its discrete 
measure, i.e., to the fraction of points it contains. To simplify matters, we restrict ourselves 
to point sets in the d-dimensional unit cube. 

The intuition is the following: a range space should have high discrepancy either if 
there is a range with small volume that contains a large fraction of points, or if there is a 
range with large volume that contains a small fraction of points. In that sense, it measures 
how good a finite set of points approximates the uniform distribution. 



Definition 1. Let P C &e a set of points and TZ he a set of subsets of [0, 1]' 
the continuous discrepancy of P with respect to IZ as 



We define 



max 



vol{R) - 



IPnPI 



|P| 



Figure [T] shows two point sets in the plane with respect to axis-parallel boxes. The 
point set in Figure 1(a) shows a point set with low discrepancy. In 1(b) , the discrepancy is 



attained for a box of high volume and few points inside. In 1(c) it is attained for a small 
volume box with many points inside. 



1.2.2 Combinatorial Discrepancy 

The combinatorial discrepancy, sometimes called red-blue-discrepancy, is a slightly different 
notion. Here, we are given a set of points P, colored red or blue, and a set of ranges. Such 
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(a) low discrepancy set 



(b) large box, few points (c) small box, many points 



Figure 1: Continuous Discrepancy 

a set is said to have high discrepancy, if there is a range where the difference between red 
and blue points is high. 

Definition 2. Let P = Pb be a set of points in M'^, and let IZ he a set of subsets of 
P. We define the combinatorial discrepancy of P with respect to TZ as 

Dn (Pr,Pb) :=max||i?nPJ - |i?nPJ|. 

1.3 Epsilon-nets 

A theory closely related to discrepancy is that of e-nets. We will give some basic terminol- 
ogy and afterwards discuss the relation of our results to e-net problems. 

For a range space (P, 7?.) , a set C P is called an £-net if for aA\ R eTZ with |P| > £:|P| 
it holds that P fl 7^ 0. This means that S intersects all large sets in TZ, namely those 
that contain at least an e-fraction of the points. 

For applications, one is of course interested in nets that are small. For general range 
spaces, one can not expect such a behavior: If TZ is the power-set of P, any e-net must be 
of size at least n — en + 1. 

Surprisingly, there are many range spaces where the size of a net does not depend 
on the value of n. The Vapnik-Chervonenkis dimension has proved as a useful tool for 
studying these range spaces. Let (P, 7^) be a range space. For a set Q C P, let prQ(P) := 
{RnQ \ R E 7Z} denote the projection onto Q. We say that a set Q is shattered by 7Z, if 
prQ(P) = V{Q), i.e., if all subsets of Q appear in the projection of P onto Q. Now the 
VC-dimension 5 of a range space is the size of the largest set Q C P that is shattered by 
7^. 

|HW86j proved, based on a work by |VC71j . that range spaces of finite VC-dimension S 
admit e-nets of size O (f logf). Their result even proves something a lot stronger, namely 
that a random sample of that size will be an e-net with high probability. 



The theory of e-nets has been a hot topic recently, most notably because of the results 
by [Alollj and jPTlO] . 



1.4 Our results 

We study the decision versions of several problems related to the above definitions. In 
Section |2| we consider the following problem. 

Definition 3. frf-RED-BLUE-DlsCREPANCvj Let P = Pb be set of points in R'^, and 
A; G N. Decide whether DB{Pr,Pb) > k, where B is the set of all axis-parallel boxes inside 
the unit cube. 

In particular, we show the following. 

Theorem 1. The problem d-RED-BLUE-DlsCREPANCY is W[l]-hard with respect to the 
dimension and NP-hard. 

This result will be easily derived by first showing the W[l]-hardness of another well 
known problem, called Bichromatic- Rectangle, where we have to find a box that 
contains as many blue points as possible, but no red points. 

Subsequently, we will investigate the case of continuous discrepancy. Thereto, let Bq 
be the set of axis-parallel boxes inside the unit cube containing the origin. Such a box 
is called a star, or subinterval. In Section |4] and Section [6} we will consider the following 
problem. 

Definition 4. frf-STAR-DlsCREPANCY^ Let P be a set of points in and V be a rational 
number. Decide whether Disq{P) > V. 

The problem where the range space is the set of all axis-parallel boxes inside the unit 
cube B is defined analogously and will be called (i-Box-DlsCREPANCY. In Sections |4] and 
[6| we show the following. (The NP-hardness of star discrepancy was shown by [GSW09j .) 

Theorem 2. The problems rf-STAR-DlsCREPANCY and (i-Box-DlsCREPANCY are W[l]- 
hard with respect to the dimension and NP-hard. 

In order to prove these two theorems, we consider two related problems, which have 
also been studied in the past. 

Definition 5. fci-MAXlMUM-EMPTY-STARj Let P be a set of points in R'^ and V be a 
rational number. Decide whether there is a box of volume V inside the unit cube that 
contains the origin but no points from P. 

Analogously, we define the problem ci-MAXlMUM-EMPTY-Box. We establish the fol- 
lowing results. 

Theorem 3. The problems rf-MAXiMUM- EMPTY- Star and (i-MAXiMUM-EMPTY-Box 
are W[l]-hard with respect to the dimension. 
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For the ^-Maximum- Empty- Star problem we immediately get a result that is a lot 
stronger: 



Theorem 4. The problem cZ-Maximum-Empty-Star cannot be approximated in fpt time 
within a factor of 2^^^, unless FPT = W[l]. 

Afterwards, we sketch hardness results for some other range spaces, such as boxes or 
half-spaces. 

Finally, we build the connection to the theory of e-nets. It is known that for many sets, 
a small random sample will be an e-net with high probability. This leads to the following 
decision version. 

Definition 6. fc?-NET-VERlFlCATlONj Let P be a set of points in R'^, S be a subset of P, 
and an e > 0. Decide whether S is an e-net for P with respect to half-spaces. 

Our main theorem in this section shows that this question cannot be answered effi- 
ciently. 

Theorem 5. The problem d-NET- Verification is co-NP-hard and co-W[lj-hard with 
respect to the dimension. 

In all our reductions, the parameter d is kept linear in the input parameter. Using a 
result by |CCF+05j . we can derive something stronger. 

Corollary 1. All of the above problem cannot be solved in time n°^'^\ unless the Exponential 
Time Hypothesis fails, i.e., unless 3-Sat can be solved in 2°'-"') time. 

These results are obtained by fpt-reductions from the W[l]-complete /c-Clique problem 
in general graphs, see jDF99|, based on the general framework by |CGK"'"11[ ICGKR08] . 



1.5 Related work. 

When the dimension is part of the input, the Bichromatic- Rectangle problem was 
shown to be iVP-hard by (EHL"'"02] : in the same paper an 0(?T,^'^"'"^)-time algorithm was 
given. |BK09] gave an algorithm that runs in 0{k\og'^~^ n) time, where k is the number of 
feasible boxes that are not properly contained in any feasible box, and showed that k can 
be G(n^) in the worst case. |AHP08j gave an (1 — £:)-approximation algorithm that runs 
in 0(nr'^/2l(^-2iQg^^)M/2+il) time. 

The Star- Discrepancy problem has been shown to be iVP-hard by |GSW09j . An 
exact algorithm that runs in 

time was given by |DEM96j . |Thi01j has given an 
approximation algorithm that achieves additive error and runs in fpt-time with respect to 
the error and the dimension. However, as |Gne08j noted, by setting the error tolerance to 
the same order as the discrepancy of an optimal point set, so that a constant-factor ap- 
proximation is achieved, the running time of any algorithm following Thiemard's approach 
becomes rf"^'^\ As for the Box-Discrepancy, no hardness results where known so far. 
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The Maximum-Empty-Box problem has been studied extensively in the planar case, 
see for example |AS87j and references therein. When the dimension is part of the input, 
the problem has only recently been shown to be A^P-hard by |BK10] and the fastest exact 
algorithm runs in time 0(n'^log'^"^n) [BKlOj . Also recently, [DJ09] gave an 0{{8ede ' 
n log'^ ?T,)-time (1 — £:)-approximation algorithm for this problem. Note that, since (logn)'^ < 
n + f{k) for some /(fc), this counts as fpt time in parameters 1/e and ci, in contrast to our 
results for MAXIMUM- EMPTY- Star. The NP-hardness of the Maximum- Empty- Star 
problem was shown by |GSW09j . 

2 Red-Blue Discrepancy and the Bichromatic Rect- 
angle Problem 

In order to show the hardness of Red-Blue-Discrepancy, we will first consider the 
Bichromatic-Rectangle problem. The parameterized decision problem is defined as 
follows: 

Definition 7. (^/c-d-BlCHROMATlc-RECTANGLE^ Let Pr he a set of red points and a set 
Pb be a set of blue points in M."^, and k E N. Decide whether there is an axis-parallel box 
such that H n Pr = ^ and \H n Pb\ > k? 

A box that does not contain any point from Pr will be called feasible. For a given set 
of points P = Pr^ Pb, let 

EB(Pr,Pb)= max \BnPb\ 

B€B,BnPr=d 

denote the size of an optimal solution. Recall that B is the set of all axis-parallel boxes 
inside the unit cube. 

2.1 The idea. 

In order to show that the fc-d-BlCHROMATlc-RECTANGLE problem is iy[l]-hard, we will 
give a reduction from the fc-CLlQUE problem. For a given simple graph G = {[n],E) we 
will construct sets Pr = Pr{G, k) and Pb = Pb{G, k) in M^^ such that G has a clique of size 
k if and only if E{Pr, Pb) = k + 1. 

In addition to the (blue) origin 0, we will put blue and red points into k pairwise 
orthogonal two-dimensional planes. These points will be used to encode the vertices of G. 
Additional red points will then be used to encode the edge-set of G. 

Each plane will contain n blue points, corresponding to the vertices of the graph, and 
n — 1 red points. The red points are placed such that no feasible box can contain more than 
one blue point from a single plane. Thus, at most k of these blue points can be contained 
in any feasible box. We will then ensure that such a box can only contain points x and y 
from two different planes if the corresponding vertices are connected in G. This is done by 
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putting red points into the product of the respective planes (which is a four- dimensional 
subspace). 

This construction will ensure that any feasible box containing k + 1 blue points corre- 
sponds to a A;-clique in G, and vice versa. 



2.2 Preparations. 

For 1 < i < k, we define the two-dimensional subspace 

= {{xi,yi, . . .,Xk,yk) I xj = Vj = 0,j ^ 0} ^ 

For 1 < i < J < k, we set Rfj to be the product of and R^j, i.e., R^j = R^^ x M^, 

For p eR"^ and g G M^, observe that the unique point in Rj^ that (orthogonally) projects 

to p (into M^) and to q (into R'j) is p + q. 



2.3 The scaffold construction. 

Let £ = 1/4. For a vertex 1 < f < n, we define the point 

bi{v) = {v,n + l -v) e 

Then, let 

(n)r^°'' = {&.(i),...,&.(ri)}cM2 

be the set of all points in the i-th plane. Choosing a (rectangle containing) point bi{v) will 
correspond to choosing vertex v from G. Let 



pscaffold _ y 



scaffold 



l<i<k 

be the set of all these blue points. 

As we want the feasible boxes to contain at most one point from each Rf, we add a set 
of red points as follows: For 1 < w < n — 1, we define rj(t>) = (t> + 1/2, n + 1 — (f + 1/2)) 
and set 

(P,)r'^°i'^ = {r,(l),...,r,,(n-1)}CM2. 

Finally, we define 

pscaffold |_|_J ^p ^scaffold 

l<j<fc 

to be the set of all red scaffolding points. See Figure [2] for an example of the scaffold 
construction. Observe that an feasible box B can contain at most one blue point from 
each (Pb)f^'^°i'i. 
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Vi 
i\ 

• r,(l) 

; • ri{n - 1) 

'■ ► Xi 

Figure 2: The scaffold construction with vertex 2 selected. 

2.4 Encoding edges. 

In order to encode the edges of the graph, we will place several red points between pairs of 
Mfs. This will forbid certain pairs of blue points to be selected at the same time, namely 
the ones that correspond to vertices not being connected in G. 

Thereto, for 1 < i < j < k and vertices 1 < u,v < n, we define the point 

rf^{uv) = b,{u) + b,{v)eRtr 

The crucial property of such a point is that it is inside a box (containing the origin) if and 
only if both bi{u) and bj{v) are inside this box. 
The set of all killing points in M.fj is then 

iPr)fi = {rf'M,r^M\uv^E}. 

As the graph is simple (i.e., contains no loops), so all points of the form rij{uu) are also 
added. Finally, we set 

pf= 1+) ml 

to be the set of all killing points. See Figure [s] for an example where uv ^ E. 

2.5 The overall construction. 

For G = (\n\,E) and A; > we construct point sets Pr{G, k), Pb{G, k) in as follows: 
• Pr{G,k) = P^'^^°^'^UP^ 
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.kill 








m 




Ml) .\ 










bj{v) 

• 

• 

• ri{n — 1) 
• b,{n) 


V 



Figure 3: bi{u) is the projection of rfj^^{uv) to M.f and bj{v) is the projection of r^'"(-ut>) to 



^1- 



• Pb{G, k) = {0} U p^«^'^ff°id 

The size of the point set is 0{k'^n'^) and the coordinates of the points can be encoded by 
0{logkn) many bits. Clearly the construction can be performed in time polynomial in 
both k and n. 

Lemma 1. G has a k-clique if and only if EjsiPri Pb) = k + 1. 



Proof. First, observe that any feasible box B can contain at most k + 1 points, as |i? fl 
^p^^scafroid| < 1^ for 1 < i < A;. Additionally, can be in B. 

Let f 1, . . . , f fc be a clique of size k. We choose a (closed) box B with upper right corner 



hi{vi) in 



B contains exactly one point from each of the M^, and also the origin, making it a total of 
k + 1 points. We show that B is feasible. First, by definition B contains no point of p^'^^^'^^^. 



h{u)+bj{v) e iPr)tj. Then 



uv) 



Further, assume that B contains a point of P^^ , say rfj^^{uv) 

B contains both bi{u) and bj{v). But this means uv ^ E, for otherwise the point r^'^'l 
would not have been added; a contradiction. 

Now assume that there is no clique of size k. Let B be any box containing k + 1 points. 
We show that B is infeasible. If B contains a red point from one of the M^'s, we are done. 
Otherwise, besides the origin it can contain at most one blue point from each Mf. Let 
u = Vi and v = Vj be two vertices corresponding to blue points contained in B that are 
not connected in G. As there is no A;-clique, such a pair must exist. Then B also contains 



the red point ' 



kill, 



uv] 



Thus, B is infeasible. 



□ 



□ 



Theorem 6. The /c-rf-BlCHROMATIC- RECTANGLE problem is W[l]-hard when parameter- 
ized with both the dimension d and the size of the solution k. 
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As noted by |CT97] an optimization problem that is l^[l]-hard when parameterized by 
the size of the solution is unlikely to have an approximation scheme that runs in f{e)n'^, 
i.e, an efficient polynomial-time approximation scheme (EPTAS). In our case, since the 
problem is hard with respect to both the dimension and the size of the solution, this 
implies the following: 

Corollary 2. The (optimization version of the) A; -rf-BlCHROMATIC- RECTANGLE problem 
does not admit an approximation scheme that runs in 0{f{l/e,d) ■poly(n, c?)) time, unless 
W[l] = FPT. 

2.6 Adaption to Red-Blue Discrepancy 

In order to adapt this proof to the Red-Blue-Discrepancy problem, we have to modify 
the set in such a way that a clique corresponds to a set with high discrepancy. Thereto, 
let be the total number of points in the construction. We replace the blue point at the 
origin by A^ copies. Now the value 

Dn{Pr,Pb) :=max||i?nP^| - \Rr]Pb\\ 

is maximized for a box with many blue points inside (as it is at least A^). Observing that, 
for each M.f, the difference between blue and red points is at most one, we can follow the 
above reasoning. This means there is a box with N + k blue points (and no red points) 
inside, if and only if G has a fc-clique. This proves Theorem [TJ 

3 The Maximum Empty Star Problem 

We now turn to the continuous version of the problems. In this section, we consider the 
problem where we have to find an empty axis-parallel box inside the unit-cube of maxi- 
mal volume that contains the origin, namely Maximum- Empty- Star. Besides showing 
the W[l]-hardness, our construction yields that it is even iy[l]-hard to approximate this 

problem by a factor of (^)'^'- 

The proof uses ideas similar to the discrete case in Section [2j As above, because the 
origin has to be included in the boxes, the planes will be considered separately. In this 
construction, the analog of a rectangle containing a blue point from one of the Mf is now 
a rectangle that is "large" (of size C for some < C < 1 to be determined later). In 
each plane, there will be n large rectangles to choose from, corresponding to the n vertices 
of G. It will only be possible to choose large rectangles from two different planes, if 
the corresponding vertices are connected in G. This yields a one-to-one correspondence 
between "large" empty boxes and cliques of size k. 

3.1 The Construction. 

We will proceed as follows: First, we determine where the upper right corners of the n 
large rectangles have to be. From this, we will determine the blocking points (which are 
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the analog of the red points above) that are needed for this. 

Let /i > 1 be a parameter to be specified later. One possibility to determine the upper 
right corners of the rectangles, each having area C = in one M^, is as follows: 



cAu] 



u-l 



u-1 



,1 < u < n. 



We now place points such that any maximal empty (open) rectangle, i.e., a rectangle 
supported by two points, has its upper right corner at one Ci{u). This can be realized by 
the following blocking points: 



1 



, < M < n. 



scaffold 



l<i< 



P 



scaffold 



We set i^scaffoid = {p.[u)\0<u< n} and P 

Thus, in each M.f, we have n choices for the upper right corner of the rectangles: the 
points Ci{u), 1 < u < n. If a rectangle has its upper right point somewhere else on (x, C/x) 



or above, it contains a point from P 



scaffold 



and any other feasible rectangle has smaller 



size. 



Choosing a large rectangle in each of the M.f gives us an empty rectangle of total volume 
C''. See Figure [4] for an example. 



p,(0)x,(l) = (C7, 1) 



Pt{u - 1)\ q('u) 




c,(rz) = (l,C) 
Pi(n) 



(0,0) 



Figure 4: The plane M?. A rectangle selecting vertex u is indicated. 



3.2 Encoding the edges. 

As above, if the vertices corresponding to two different large rectangles in the planes 
and ]R| are not connected, we will add a point in the product M.fj that forbids these two 
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rectangles to be chosen at the same time. To this end, we let 

(these points are themselves not added to the set P) and then define ) = pY^^{u) + 

pY^^{v). Recall that this also includes all points of the form pij{uu). Then we set 

P'' = {pf{uv)\zj^j,uv^E}, 

and 

p p)E ^scaffold 

The size of P is 0{n'^k'^). If we set = 2, all coordinates have size polynomial in the size 
of the input. We also let V = C'' = 1/^'^'^"^^^ Clearly the construction can be performed 
in time polynomial in k and n. 

Now we come to prove the correctness of the construction. Let Fi{u) be the rectangle 
with corners pi{u — 1), Ci{u), Pi{u), pY^^{u), as indicated in Figure |4j 

Lemma 2. Any feasible rectangle in M? that does not intersect any region Fi{u), 1 < u < n, 
has size at most 

Proof. Such a rectangle has its upper right point below the graph going through the points 
Pi{u), 1 < u < n, which is (x, ^). □ □ 

We use this to prove the main Lemma, the continuous analog of Lemma [T] 

Lemma 3. G has a k-clique if and only if there is an empty star of size V = . Further, 
if G does not have a k-clique, the largest empty star has volume C^//i. 

Proof. Let fi, . . .ffc be a clique in G. In each M^, 1 < i < k, choose the rectangle with 
upper right corner [C ■ f^^^~^, ). Then the box is the product of these rectangles and it 
has volume C^. By definition, it does not contain any point from one of the M^. If it would 
contain a point pijiuv) G Mjj, then the projection of pij{uv) onto both and would be 
contained in the corresponding rectangles. But this means that uv ^ E, a contradiction. 

If there is no fc-clique, any selection of k large rectangles that do not contain a point 
in would contain a point in one of the M.fj, as among any k vertices there are at least 
two that are not connected. Thus, in order to avoid all points, by Lemma |2] in at least one 
M.f we cannot select a large rectangle intersecting any of the Fi{u). Thus, the total volume 
can be at most C"=-i ■ ^ = C"=/u. □ □ 

Thus, we have shown the first part of Theorem [3] 
3.3 An inapproximability result. 

Now it easily follows that the problem is even hard to approximate: the /i chosen above can 
be made very large. By Lemma [3} the ratio between a large box for a point set constructed 
from a positive instance and one constructed from a negative instance is at least /i. Since 
we can choose /i = 2*^*^l^l) (this takes only polynomially many bits). Theorem follows. 
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4 The Star Discrepancy Problem 



In this section we show that computing the star discrepancy of a point set inside the unit 
cube is Vr[l]-hard. 

There are two reasons why the previous reduction does not give us the hardness-result 
for this problem right away: 

• First, the maximum discrepancy can be attained by either a large box with few points 
inside or by a small box with many points inside. For example, in our construction 
from Section |3| large point sets lie in a box with (affine) dimension d — 1 and thus a 
volume of 0. 

• Second, even if the maximum is attained for a large box, it might still contain some 
points, in which case our construction would fail. 

However, we can get rid of both problems by simply choosing the right value for /i, and 
thus, C. Recall that these values determined the size of the largest empty box. C is exactly 
the area of a maximum empty rectangle in each M.f. 

For a graph G, let N be the total number of points in our construction from the previous 
section. Recall that N G 0{k'^n'^). Any box (containing the origin) that contains all points 
has volume of 1, as there are points with Xi = 1 and yi = 1 for all 1 < i < k. This leads 
to the following observation. 

Corollary 3. For all boxes B containing the origin we have 

|5nP| N-i 

' ' - vol{B) < 



N 

That means that the fraction of points in any box can be bigger by at most {N — 1)/N than 
its volume. 

Thus, our construction from Section [3] works if we can ensure that the largest empty 
box in a positive instance has volume more than (A^ — 1)/N, that means 



Then the discrepancy is attained for an empty box: Enlarging the box can increase the 
volume by at most 1 - But as > ^ = (1 - ^), we have that (1 - C") < 1/N. 
Thus, picking such an extra point cannot increase the discrepancy. This means that we 
can choose fi such that 

1 

1 < /i < ■ 



N- 1 



To make sure that /z requires only polynomially (in k and n) many bits, observe the 
following. 
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Lemma 4. For fi = 1 + ^ with t = 2knN , it holds that /i''^" < -j^. 
Proof. Observe that = Xli^o iwT- Then 

kn kn /, \ ^ kn 



Thus, 



t ^\2N N -I 

i=0 ^ ^ i=0 



□ □ 

Constructing the set P with this value of /i, we immediately get: 
Lemma 5. G has a clique of size k, if and only if Dq^^P) = . 

Proof. By the previous remarks, the maximum is attained for a large empty box. Then, 
the proof follows from Lemma |3} □ □ 

This proves the first part of Theorem |2] 

5 Largest Empty Box Problem 

In the two upcoming sections, we will consider the analogous problems for the case when 
the origin does not have to be contained in the boxes. That means our range space is 



B, the set of all axis-parallel boxes inside the unit cube, as defined in Section |1.2[ We 
start with the case where we have to find a large empty box inside the unit cube, namely 

ci-MAXIMUM-EMPTY-BOX. 

This problem is quite different in the sense that, as the box does not have to contain the 
origin, now the cannot be considered separately any more. This kills our construction 
from the previous section: the box (0, 1)^*^ does not contain any points from P but has 
volume 1. 

The plan is to reestablish this dependence, so that we can use the same reasoning as 
above. This can be done by a simple trick, which we call lifting: From a graph G, we first 
construct the set P as in Section [s] with the constant G'^ = 2/3. Then, define the function 
lift: M^fe ^ ^2fc follows: 



lift(a;i, . . . , X2k) = (x'l, . . . , x^^.) with x- 



Xi if Xj 7^ 

c'j = 1/2 otherwise. 



Now we apply the function lift to all points in the set P. For the lifted point x, we call the 
M? that the point was lifted from the corresponding M.f. This gives the following: 
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Lemma 6. Any box B E B having volume at least 2/3 contains a point x if and only if 
the projection onto the corresponding M? contains the projection of x. 

Proof. As the box has volume at least 2/3, each of its projections onto any of the has 
an area of at least 2/3, for otherwise the total volume would be less than 2/3 ■ 1. Thus, 
(1/2, 1/2) is contained in the projection of b to M^, for all 1 < j < k. 

This means that every large box (of volume at least 2/3) contains every point in all 
dimensions, except possibly for the corresponding M^. From this, the claim follows. □ □ 

Further, any box of volume 2/3 has its lower left endpoint inside [0,1/2)^'^. As all 
points lie inside [1/2, 1]^^, we can extend any large empty until its "lower left" corner is 
the origin. 

After these modifications, applying Lemma [6} we can use the same arguments as in 
Section [3} There is an empty box of volume if and only if G has a fc-clique. This proves 
the second part of Theorem |3] 



6 The Box Discrepancy Problem 

In order for our proof to work for this case, we will combine the ideas of the previous 
sections. Recall that we want to compute the box discrepancy 



D-jy (P) := max 
^ ' Ren 



vol{R) - 



\R^P\ 



\P\ 



of a point set P. 

To simplify our arguments, we make sure that any box containing all points has volume 
1. Thereto, we add one point at the origin and one point at (1, 1, . . . , 1). 

Now we construct the point set with the constants determined in Section |4] (with 
increasing by 2 because of the additional points). For this, we again choose C large enough 
so that the maximum is attained for a large box with no points inside, i.e., so that 



A^ 



A^ 



Finally, we lift all points (except for the origin) as in Section [5j Using the same arguments. 
Theorem |2] follows. 



7 Other geometric range spaces 

So far we have considered range spaces determined by (restricted or unrestricted) boxes 
inside the unit cube, namely B and Bq. Similar questions can be asked when the ranges are 
determined by other (geometric) objects. We give a few examples and a short discussion 
on how to adapt the proofs to these ranges. We restrict ourselves to the analog of the 
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bt(l) 



r+(l) 



■ • 



r~{n — 1) 



; • r+{n-l) 



Xi 



(a) Cubes 



Figure 5: Other range spaces 



r,{n + 1) 



(b) Half-spaces 



Bichromatic-Rectangle problem, from which the hardness of computing the combi- 
natorial discrepancy is easily derived. That means for a point set P = Pb and range 
space 7^, we consider the value 

ETi(Pr,Pb)-= max \RnPb\. 

R&R,RnPr=d 

Cubes If the range space is Q, the set of all cubes inside the unit square, we can adapt 



the construction as shown in Figure 5 (a 



Convex sets Here, the same arguments as in Section |2] work as well: Any convex set 
that does not contain any red points can contain at most one blue point from each M^. 
Further, also as a direct consequence of convexity, the encoding of the edges works as well. 



Half-spaces Instead of putting all points on a line, we now put all points on a convex 



curve as in Figure 5(b) Note the two additional red points on both ends to prevent 
and bi{n) to be chosen at the same time (by a hyperplane that does not contain the origin). 

Theorem 7. The problems d-CuBE-DiscREPANCY, rf-CoNVEX- Sets-Discrepancy and 
d-HALF-SPACE-DlsCREPANCY are W[lJ-hard with respect to d. 



8 Implications on verification of epsilon-nets 

We concentrate on the problem (i-NET- Verification for half-spaces. The proof can easily 
be adapted to the other range spaces considered. 
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Recall that, for given sets S" C P c M'' and an e > 0, we want to decide whether every 
half-space that contains at least e\P\ points also contains a point from S. 

To see why this problem is hard, consider the construction for the d-BlCHROMATlc- 
Rectangle problem from Section |2} There, we had a set Pr of red and a set Pb of 
blue points. We have shown that it is W[l]-hard to decide whether there is a half-space 
containing k blue and no red points. To reduce this problem to d-NET- Verification, 
we set P to be the set of all points, and S to be the set of red points P,.. Then, we set 
e = k/\P\. Now a half-space containing k points corresponds to a large set that is not 
intersected: it contains k = e\P\ blue points but no red points. But this means that S is 
not an £-net. This proves Theorem [5] 

9 Conclusion 

As the problems we have considered are all computationally hard when d is part of the 
input, we have to resort to approximation algorithms when dealing with them. We have 
seen that for the Maximum- Empty- Star problem, there is no hope for a polynomial 
time algorithm with a reasonable approximation factor at all. The (in)approximability 
of the (star) discrepancy is open even in the classical complexity theory framework. The 
question whether it can be approximated in FPT time when parameterized by dimension 
seems worthwhile considering. 
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